High Performance Parallel Database Processing and Grid Databases
نویسندگان
چکیده
data parallelism for a decision tree, 489–492 data set structure, 479–480 decision tree algorithm, 480–481 decision tree classification, 477–480 processes, 480–488 structure, 478–479 result parallelism for the decision tree, 492–495
منابع مشابه
Database Placement on Large-Scale Systems
Large-scale systems such as Grids offer infrastructures for both data distribution and parallel processing. The use of Grid infrastructures is a more recent issue that is already impacting the Distributed Database Management System industry. In DBMS, distributed query processing has emerged as a fundamental technique for ensuring high performance in distributed databases. Database placement is ...
متن کاملDisk Allocation Methods for Parallelizing Grid Files
The grid file [1] is a well known access method for multi-dimensional and spatial data. The response time needed to process path and range queries on the grid file access method can be improved significantly by distributing the data pages over multiple disks. This paper explores the disk allocation methods used to allocate the data pages of grid file among a set of disks, which can be accessed ...
متن کاملDatabase Support for Data-Driven Scientific Applications in the Grid
In this paper we describe a services oriented software system to provide basic database support for efficient execution of applications that make use of scientific datasets in the Grid. This system supports two core operations: efficient selection of the data of interest from distributed databases and efficient transfer of data from storage nodes to compute nodes for processing. We present its ...
متن کاملA Framework for Parallel Query Processing on Grid-Based Architecture
With relations growing larger, distributed, and queries becoming more complex, parallel query processing is an increasingly attractive option for improving the performance of database systems. Distributed and parallel query processing has been widely used in data intensive applications where data of relevance to users are stored at multiple locations. In this paper, we propose a three-tier midd...
متن کاملA Parallel Implementation of Homogeneity Analysis
Advancements in processor performance during the last decade, along with a growing commercial demand for high performance computing, has contributed to the proliferation of parallel processing [8]. One factor driving this demand is the increasing size of databases. In order to analyze such large databases, using sophisticated statistical techniques, statisticians may find themselves at a loss f...
متن کامل